<<<<<<< HEAD Alcohol ======= <<<<<<< HEAD Assignment 2 ======= Assignment2_draft >>>>>>> 7ee7f3ca4630f0a16e7d243144eb8a12b237717f >>>>>>> 452f1d31bff7bff857fd9e246d732a95ef8639d8 <<<<<<< HEAD ======= <<<<<<< HEAD ======= >>>>>>> 452f1d31bff7bff857fd9e246d732a95ef8639d8 <<<<<<< HEAD ======= >>>>>>> 7ee7f3ca4630f0a16e7d243144eb8a12b237717f >>>>>>> 452f1d31bff7bff857fd9e246d732a95ef8639d8
<<<<<<< HEAD
## package 'tidyverse' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'knitr' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'readr' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'dplyr' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'broom' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'forecast' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'repr' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'lubridate' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'janitor' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages
## package 'plotly' successfully unpacked and MD5 sums checked
## 
## The downloaded binary packages are in
##  C:\Users\Naveed Aamir\AppData\Local\Temp\RtmpK4oeg8\downloaded_packages

1 ASSESSMENT 2

1.1 INTRODUCTION

While alcohol is often consumed for leisure, relaxation, and celebration, it unfortunately becomes a crutch for temporarily alleviating stress, grief, and sorrow. Furthermore, the consumption and types of alcohol have also become associated with social status. However, it is important to acknowledge that alcohol consumption can also lead to various disorders that may result in fatal consequences.

Thus, this report delves into the influence of income on alcohol consumption, explores the contributing variables to alcohol-related deaths, and discusses countries with the highest percentage of deaths attributed to alcohol use disorders.

2 Section Naveed

2.1 Question 1

  • Do people with higher income consume more alcohol and how does the consumption of alcohol vary with an increase in income?
options(scipen = 999)


# Wrangle and get the data ready for the model
alcohol2 <- filter(alcohol, GDP_per_capita>=0, GDP_per_capita<=70000) %>% select(Entity,GDP_per_capita,Total_alcohol_consumption_per_capita)

alcohol3 <- na.omit(alcohol2)

# scatter plot of consumption per person vs income
sp <- ggplot(data = alcohol3, aes(x = GDP_per_capita, y = Total_alcohol_consumption_per_capita)) +
  geom_point() + labs(title = "Pure Alcohol Consumption per person with Income",x = "GDP Per Capita (USD)",y = "Pure Alcohol Consumption per Capita (Litres)",subtitle = "Consumption of alcohol per person increases with Income and eventually becomes constant", caption = "Alcohol Consumption Data 2000-2020")



spi <- ggplotly(sp) %>% layout(title = list(text = paste0("Variation in Pure Alcohol Consumption per person with Income",
                                                          "<br>",
                                                          "<sup>",
                                                          "Consumption of alcohol per person increases with Income and eventually becomes constant",
                                                          "</sup>")), x = "Income per person (USD)", y = "Pure Alcohol Consumption per person (Litres)",
                               annotations = list(x=1, y=-0.1, text = "Alcohol Consumption Data 2000-2020", showarrow = F, xref = 'paper', yref = 'paper', xanchor = 'right', yanchor = 'auto', xshift = 0, yshift = 0, font = list(size=10, colour="red")))
spi

Figure 2.1: plot1

# *****
# Fit a simple linear regression model of consumption per person on income
con_fit <- lm(Total_alcohol_consumption_per_capita~GDP_per_capita, data=alcohol3)
#tidy(con_fit)
# Add predictions, residuals, etc. to the training data
con_model <- augment(con_fit, alcohol3)

# Fitting the estimated model over the data
con_mod <- ggplot(con_model, aes(x=GDP_per_capita, y=Total_alcohol_consumption_per_capita)) + 
  geom_point() + labs(title = "Linear Model of Pure Alcohol Consumption per person on Income",x = "GDP Per Capita (USD)",y = "Pure Alcohol Consumption per Capita (Litres)",subtitle = "The relationship between Consumption per person and Income is not linear", caption = "Alcohol Consumption Data 2000-2020") + geom_line(aes(y=.fitted), colour="blue")
  
con_modi <- ggplotly(con_mod) %>% layout(title = list(text = paste0("Linear Model of Pure Alcohol Consumption per person on Income",
                                                          "<br>",
                                                          "<sup>",
                                                          "The relationship between Consumption per person and Income is not linear",
                                                          "</sup>")), x = "GDP Per Capita (USD)", y = "Pure Alcohol Consumption per person (Litres)",
                               annotations = list(x=1, y=-0.1, text = "Alcohol Consumption Data 2000-2020", showarrow = F, xref = 'paper', yref = 'paper', xanchor = 'right', yanchor = 'auto', xshift = 0, yshift = 0, font = list(size=10, colour="red")))

options(scipen = 999)
con_modi

Figure 2.2: plot2

# *****
# Summary of the fitted linear regression model
con_lin_fit <- lm(formula = Total_alcohol_consumption_per_capita ~ GDP_per_capita, data = alcohol3)

# Return a summary of the fitted linear regression model
summary(con_lin_fit)
## 
## Call:
## lm(formula = Total_alcohol_consumption_per_capita ~ GDP_per_capita, 
##     data = alcohol3)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -10.9376  -2.4253   0.0752   2.3196  13.2824 
## 
## Coefficients:
##                   Estimate  Std. Error t value            Pr(>|t|)    
## (Intercept)    4.322541227 0.226113925   19.12 <0.0000000000000002 ***
## GDP_per_capita 0.000105162 0.000009377   11.21 <0.0000000000000002 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 3.683 on 568 degrees of freedom
## Multiple R-squared:  0.1813, Adjusted R-squared:  0.1798 
## F-statistic: 125.8 on 1 and 568 DF,  p-value: < 0.00000000000000022
# *****
# Examine outliers
alc_outliers <- arrange(alcohol3, desc(Total_alcohol_consumption_per_capita))
# *****
# Table that displays five countries each for both highest gdp per capita and lowest gdp per capita
alcohol10 <- arrange(alcohol3, desc(GDP_per_capita))

alcohol_gdp <- filter(alcohol10, Entity %in% c("Switzerland", "Brunei", "United Arab Emirates", "Norway", "United States", "Burundi", "Central African Republic", "Democratic Republic of Congo", "Somalia", "Mozambique"))

knitr::kable(alcohol_gdp, caption = "Some countries with one of the highest and lowest GDP Per Capita and their corresponding alcohol consummption per capita")
Table 2.1: Some countries with one of the highest and lowest GDP Per Capita and their corresponding alcohol consummption per capita
Entity GDP_per_capita Total_alcohol_consumption_per_capita
Switzerland 68025.9219 11.580
Brunei 67753.4219 0.510
United Arab Emirates 66968.2188 3.900
Switzerland 65819.7344 11.410
United Arab Emirates 65267.4141 3.790
Norway 64341.2578 7.410
Brunei 63147.4844 0.460
Norway 62987.5156 7.520
United States 61585.7578 9.870
Norway 61353.4766 8.940
Brunei 60389.1836 0.480
United States 58540.2969 9.820
United Arab Emirates 54921.7969 3.150
United States 54315.9141 9.360
Mozambique 1289.7360 2.300
Mozambique 1262.6132 2.190
Central African Republic 1201.3522 3.980
Somalia 1129.5315 0.009
Democratic Republic of Congo 1085.8937 2.000
Democratic Republic of Congo 1065.2424 2.050
Mozambique 1027.2089 1.630
Somalia 1026.0382 0.013
Central African Republic 933.1099 2.380
Democratic Republic of Congo 865.6840 1.750
Central African Republic 852.7492 2.430
Burundi 846.1691 7.660
Burundi 825.2057 7.030
Burundi 761.5241 7.190
# *****
  • The scatter plot in (Figure 2.1) does show that, in general, consumption of alcohol is found to be higher for people with higher income.

  • At first, the consumption of alcohol increases exponentially as income increases and then becomes approximately constant around and beyond a GDP Per Capita of USD 20000.

  • The summary table of the linear regression model of consumption per person on income in shows that for an increase of USD 1000 in income the consumption of alcohol per person increases by 105.2 ml

  • The overall p-value and the one for the variable GDP Per Capita is much lower than the usual significance level of 5%. which shows that alcohol consumption per person is dependant on income.

  • However, the linear regression model only explains about 18.13% of the variation in alcohol consumption per person, which suggests that the relationship between alcohol consumption per person and income may not be linear and the model is not very accurate.

  • This is confirmed by the plot of the fitted linear regression model in (Figure 2.2), which also shows that most of the observations lie above the line of best fit and suggests a curved relationship instead.

  • Therefore, the relationship is not linear and the model can be optimized further.

  • With a GDP Per Capita of approximately USD 27500, Seychelles is an outlier Ritchie et al. [2018]. and has the highest alcohol consumption per person of 20.5 L, further research may confirm if this because Seychelles is a very popular tourist and retirement spot.

  • With a GDP Per Capita of approximately USD 27600, Malaysia is also an outlier Ritchie et al. [2018]. and has one of the lowest alcohol consumption per person of 0.85 L, further research may confirm if this because of religious influence, lack of preference and a lack of affordability in Malaysia.

  • The table in 2.1 shows countries with various GDP Per Capita and reiterates the fact that countries with higher income such as Switzerland, Norway and the United States have a higher alcohol consumption per person compared to countries with lower income such as Congo, Somalia and Mozambique. This table also confirms the earlier analysis that after a particular income level the alcohol consumption per person is relatively constant.

  • However, further research into the outlier Burundi with a GDP Per Capita of approximately only USD 800 but a nearly 400% higher alcohol consumption per person compared to the other lower income countries mentioned above, may suggest it to be the result of cultural influence.

  • Similarly, both the outliers UAE and Brunei as compared to the other countries with high income mentioned above have a comparatively low alcohol consumption per person. Further research may help to determine if this a result of religious influence in the respective countries.

2.2 Question 2

  • Is death dependant on consumption, income and alcohol household expense? Which countries had the highest percentage of deaths from alcohol use disorders in 2015? Are there any interesting findings and any possible potential underlying reasons?
# select variables for regression
selected.var <- c(4, 6, 11, 12)
# partition data
set.seed(1)  # set seed for reproducing the partition
train.index <- sample(c(1:3618), 2000) 
train.df <- alcohol[train.index, selected.var]
test.df <- alcohol[-train.index, selected.var]

# death regress on consumption, income and alcohol household expense
death_m <- lm(Death_alcohol_use_disorders ~., data = train.df)
options(scipen = 999)
summary(death_m)
## 
## Call:
## lm(formula = Death_alcohol_use_disorders ~ ., data = train.df)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -5.7650 -2.3255 -0.5326  1.9368  6.2142 
## 
## Coefficients:
##                                           Estimate  Std. Error t value Pr(>|t|)
## (Intercept)                            -3.32124062  2.44233274  -1.360 0.190664
## GDP_per_capita                         -0.00015013  0.00005195  -2.890 0.009751
## Alcohol_expenditure_rate_per_household  0.49992440  1.11396453   0.449 0.658947
## Total_alcohol_consumption_per_capita    1.17868767  0.28556502   4.128 0.000632
##                                           
## (Intercept)                               
## GDP_per_capita                         ** 
## Alcohol_expenditure_rate_per_household    
## Total_alcohol_consumption_per_capita   ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 3.787 on 18 degrees of freedom
##   (1978 observations deleted due to missingness)
## Multiple R-squared:  0.5665, Adjusted R-squared:  0.4942 
## F-statistic: 7.841 on 3 and 18 DF,  p-value: 0.001487
# *****
# % death in alcohol use disorders vs alcohol consumption per capita
sp_death_con <- ggplot(data = alcohol, aes(x = Total_alcohol_consumption_per_capita, y = Death_alcohol_use_disorders)) +
  geom_point() + labs(title = "Variation in death in alcohol use disorders",x = "Pure Alcohol Consumption per Capita (Litres)",y = "Death in alcohol use disorders(%)",subtitle = "Percentage death increases with an increase in alcohol consumption per person", caption = "Alcohol Consumption Data 2000-2020")



sp_death_coni <- ggplotly(sp_death_con) %>% layout(title = list(text = paste0("Variation in death in alcohol use disorders",
                                                          "<br>",
                                                          "<sup>",
                                                          "Percentage death increases with an increase in alcohol consumption per person",
                                                          "</sup>")), x = "Pure Alcohol Consumption per Capita (Litres)", y = "Death in alcohol use disorders(%)",
                               annotations = list(x=1, y=-0.1, text = "Alcohol Consumption Data 2000-2020", showarrow = F, xref = 'paper', yref = 'paper', xanchor = 'right', yanchor = 'auto', xshift = 0, yshift = 0, font = list(size=10, colour="red")))


sp_death_coni

Figure 2.3: plotA

# % death in alcohol use disorders vs GDP Per Capita
sp_death_gdp <- ggplot(data = alcohol, aes(x = GDP_per_capita, y = Death_alcohol_use_disorders)) +
  geom_point() + labs(title = "Variation in death in alcohol use disorders with increase in income",x = "GDP Per Capita (USD)",y = "Death in alcohol use disorders(%)",subtitle = "Percentage death decreases with an increase in GDP Per Capita", caption = "Alcohol Consumption Data 2000-2020")

sp_death_gdpi <- ggplotly(sp_death_gdp) %>% layout(title = list(text = paste0("Variation in death in alcohol use disorders with increase in income",
                                                          "<br>",
                                                          "<sup>",
                                                          "Percentage death decreases with an increase in GDP Per Capita",
                                                          "</sup>")), x = "GDP Per Capita (USD)", y = "Death in alcohol use disorders(%)",
                               annotations = list(x=1, y=-0.1, text = "Alcohol Consumption Data 2000-2020", showarrow = F, xref = 'paper', yref = 'paper', xanchor = 'right', yanchor = 'auto', xshift = 0, yshift = 0, font = list(size=10, colour="red")))


sp_death_gdpi

Figure 2.4: plotB

# *****
# % death in alcohol use disorders vs household alcohol expense
sp_death_exp <- ggplot(data = alcohol, aes(x = Alcohol_expenditure_rate_per_household, y = Death_alcohol_use_disorders)) +
  geom_point() + labs(title = "Variation in death in alcohol use disorders with increase in household alcohol expense",x = "Household Alcohol Expense (%)",y = "Death in alcohol use disorders(%)",subtitle = "Percentage death increases with an increase in houeshold alcohol expense", caption = "Alcohol Consumption Data 2000-2020")

sp_death_expi <- ggplotly(sp_death_exp) %>% layout(title = list(text = paste0("Variation in death in alcohol use disorders with increase in household alcohol expense",
                                                          "<br>",
                                                          "<sup>",
                                                          "Percentage death increases with an increase in houeshold alcohol expense",
                                                          "</sup>")), x = "Household Alcohol Expense (%)", y = "Death in alcohol use disorders(%)",
                               annotations = list(x=1, y=-0.1, text = "Alcohol Consumption Data 2000-2020", showarrow = F, xref = 'paper', yref = 'paper', xanchor = 'right', yanchor = 'auto', xshift = 0, yshift = 0, font = list(size=10, colour="red")))


sp_death_expi

Figure 2.5: plotC

# *****
# countries in 2015
countries_2015 <- filter(alcohol, Year == 2015) %>% select(Entity, Death_alcohol_use_disorders, Total_alcohol_consumption_per_capita, GDP_per_capita, Alcohol_expenditure_rate_per_household) 

# top 7 countries with highest percentage of alcohol related deaths in 2015
countries_deaths <- filter(countries_2015, Death_alcohol_use_disorders >= 12.5, Death_alcohol_use_disorders <= 22) 

countries_deaths_highest <-  arrange(countries_deaths, desc(Death_alcohol_use_disorders))

knitr::kable(countries_deaths_highest, caption = "The top 7 countries with the highest percentage of deaths in alcohol use disorders in 2015")
======= <<<<<<< HEAD

Introduction

Question 1

  • How has the total alcohol consumption per capita changed over time in relation to geographical regions?

Analysis

Table 1 presented the overall median global growth of alcohol consumption from 2000 to 2018. However, Figure 1 revealed that regions were still experiencing diverging patterns during this period. Regions such as ‘East Asia and Pacific’, ‘South Asia’, ‘North America’ and ‘Middle East and North Africa’ increased in alcohol consumption, while other regions like ‘Europe and Central Asia’, ‘Latin America and Caribbean’ and ‘Sub-Saharan Africa’ experienced declines.

Table 1 provided insights of the average (6.18L) and median (6.11L) alcohol consumption in 2010, this year stands as the highest in total consumption. Similarly, Figure 2 presents the visual depiction of countries around the world reaching their highest in 2010, followed by subsequent decrease.


Table 1

Table 1 displays the IQR, the range, Mean and Median values of total alcohol consumption in 2000, 2005, 2010, 2015 and 2018

Table 2.2: The top 7 countries with the highest percentage of deaths in alcohol use disorders in 2015
Entity Death_alcohol_use_disorders Total_alcohol_consumption_per_capita GDP_per_capita Alcohol_expenditure_rate_per_household
Table 1: Alcohol Consumption Statistics throughout the Years
Statistics 2000 2005 2010 2015 2018
Min. 0 0.018999999 0.032000002 0.003 0.003
1st Qu. 2.410000086 2.309999943 2.470000029 2.38499993125 2.3975000975
Median 5.488336086 5.369999886 6.110000134 5.8805630205 5.7849998475
Mean 6.06807314336548 6.12577095275635 6.18171174677114 6.11046166576238 6.03252799644059
3rd Qu. 9.149999619 9.359999657 9.649999619 9.564999819 9.260000229
Max. 17.45000076 19.95000076 17.43000031 18.35000038 20.5

Figure 1

Figure 1 displays the trends in total alcohol consumption across geographical regions from 2000 to 2018, with the World’s rate shown as a dot-dashed line

  • ‘Europe and Central Asia’ region experienced a notable decrease in consumption, while ‘Latin America and Caribbean’ and ‘Sub-Saharan Africa’ regions saw slight declines

  • ‘East Asia and Pacific’ and ‘South Asia’ regions presented rapid growth, particularly between 2005 and 2010

  • ‘North America’ and ‘Middle East and North Africa’ regions had minor increases, however the ‘Middle East and North Africa’ remained the lowest

Region Chart

Figure 1: Region Chart


Question 2

  • How does the influence of religion relate to variations in total alcohol consumption per capita across different locations?

Analysis

Figure 1 highlighted the region ‘Middle East and North Africa’ as having had experienced the least growth and changes from 2000 to 2018 in comparison to the others. Notably, ‘Middle East and North Africa’ consistently exhibited the lowest alcohol consumption (consuming less than 1L each year). Similarly, Figure 2 shows the geographical locations: the Middle East, North Africa and the Greater Indonesia to have had minor changes over the years.

Table 2 demonstrates consistency among the lowest 10 countries in alcohol consumption over the years, with only minor variations. Interestingly, a significant number of these countries are known for their devotion of Islamic values.

Although the impact of religion on alcohol consumption cannot be definitively determined without further global religious information. Figure 1, Table 2 and Figure 2 suggest that these locations (the Middle East, North Africa and the Greater Indonesia) may experience lower alcohol consumption due to religious beliefs.


Figure 2

Figure 2 presents the alcohol consumption per capita in 2000, 2005, 2010, 2015 and 2018. Some notable observations are:

  • In 2000, Russia showed high alcohol consumption but over time it declined to 11.19L in 2018

  • In 2010, most countries had their highest rate of alcohol consumption

  • From 2000 to 2018, the majority of North Africa, the Middle East and the Greater Indonesia regions presented minimal changes in alcohol consumption - this may be attributed by Islamic beliefs as the Quran deems drinking as haram


Table 2

Table 2 2 presents the lowest 10 countries in 2000, 2005, 2010, 2015 and 2018 in alcohol consumption

Table 2: Bottom 10 Countries in Alcohol Consumption (Litres)
Entity 2000 2005 2010 2015 2018
Brunei 0.230000004 0.209999993
Niger 0.230000004 0.25
Iran 0.209999993
Mauritania 0.200000003 0.129999995 0.086000003 0.033 0.035999998
Comoros 0.189999998
0.25
Saudi Arabia 0.180000007 0.170000002 0.170000002 0.200000003 0.189999998
Pakistan 0.085000001 0.159999996 0.189999998 0.289999992 0.340000004
Libya 0.064000003 0.082999997 0.097999997 0.028999999 0.018999999
Kuwait 0.037999999 0.018999999 0.032000002 0.003 0.003
Somalia 0 0.068000004 0.280000001 0.013 0.009
Egypt
0.289999992
Bangladesh
0.159999996 0.159999996 0.021 0.018999999
Afghanistan
0.209999993 0.209999993 0.209999993
Yemen
0.180000007 0.055 0.050999999
Syria
0.280000001 0.239999995
=======
library(tidyverse)
library(plotly)
library(bookdown)
library(rnaturalearth)
library(rnaturalearthdata)
library(maps)
library(dslabs)
library(knitr)
library(scales)
library(dplyr)
library(gridExtra)
#Load merged data
data_init <- read.csv("Alcohol/Merge_all_2010_2020_updated.csv")

#Subset dataset for the section
data_init_mod <- data_init %>%
  select(c("Entity", "Year", "Code", "Death_alcohol_use_disorders"))

1 Introduction

1.1 Gloria’section

1.1.1 Research questions

Consuming too much alcohol and not being able to control drinking can harm an individual’s physical or mental health and safety, causing to family or financial problems. Alcohol use disorders widely affect people across the globe. In this section, we will examine the death caused by alcohol use disorders directly worldwide and attempt to answer the following two questions:

  1. Which countries worldwide were most affected by alcohol use disorders?

  2. Has the situation improved or got worse over the past ten years?

1.1.2 Countries in cold areas tend to be more affected by alcohol use disorders.

Belarus and Mongolia had the highest direct death rates (not including indirect death from suicide) of 21.80 and 17.05 per 100,000 people, respectively, followed by Russia, El Salvador, and Greenland (Table 1.1).

Most of the countries with high death rates are in the subarctic or arctic zones in the northern hemisphere (Figure 1.1). This is in line with findings from section 1. The cold and dark climates in these countries might have contributed to this observation.

However, countries in hot and humid weather such as Guatemala and Brazil also had high death rates from alcohol abuse. Countries in the southern subarctic zone with high alcohol consumption levels only showed low to moderate death rates. Therefore, cultural, genetic, historical, and religious factors cannot be ignored when investigating the underlying reason of high death rates in these countries.

1.1.2.1 Plot

Figure 1.1: Annual Death rates from alcohol

1.1.2.2 Table

>>>>>>> 452f1d31bff7bff857fd9e246d732a95ef8639d8 <<<<<<< HEAD
Table 1.1: The top 20 countries with highest annual number of death caused by alcohol use disorders.
Entity Mean
Belarus21.86 12.00 18362.746 NA
Mongolia 16.48 10.97 11009.715 NA
El Salvador 15.01 3.59 8143.326 NA
Russia 14.20 12.47 25488.096 1.8
Greenland 13.18 NA NA NA
Saint Kitts and Nevis 13.13 8.88 23966.498 NA
Guatemala 12.58 2.55 8125.656 NA
# *****
  • The Multiple Linear Regression model of percentage death (in alcohol use disorders) regressed on consumption, income and household alcohol expense has been used to investigate Question 2.

  • The model summary table, shows that a 1 Litre increase in annual pure alcohol consumption per person increases the death percentage in alcohol use disorder by 1.179%.

  • However, an increase of USD 10000 in GDP Per Capita decreases the percentage death in alcohol use disorder by 1.5% and an increase of 1% in the weekly household alcohol expense increases the percentage death in alcohol use disorder by 0.5%

  • The overall p-value is lower than the usual significance value of 5%, which suggests that percentage death in alcohol use disorders is dependant on the predictors present in the model.

  • The Multiple R-Squared value in the model summary table shows that the model explains 56.65% of the variation in percentage deaths in alcohol use disorders. Therefore, the model may be considered to be accurate, however, there is space for improvement.

  • The scatter plot in (Figure 2.3) of deaths from alcohol use disorders vs alcohol consumption per person shows that at first the percentage death increases slightly with an increase in consumption but then it increases exponentially beyond approximately 7.5 litres.

  • The scatter plot in (Figure 2.4) of deaths from alcohol use disorders vs GDP Per Capita shows that the percentage death is higher for countries with low GDP Per Capita and then the percentage death decreases exponentially as GDP Per Capita increases till it eventually almost becomes constant.

  • Further research may determine whether this decline in percentage deaths may be a result of the availability of safer and better quality alcohol for consumption, better individual health, quality of healthcare and affordability of better healthcare. These may also be some of the additional predictors that may help explain the remaining 55% of variation in deaths from alcohol use disorders.

  • The scatter plot in (Figure 2.5) of deaths from alcohol use disorders vs household alcohol expense shows that the percentage death is about constant at first but then increases exponentially beyond a household alcohol expense of 1.5%. This is contrary to the earlier description of the relationship between the same two variables by the Multiple Linear Regression model. Some of the observed outliers may be a result of the additional predictors mentioned above. This scatter plot is also reflective of the deaths from alcohol use disorders vs alcohol consumption per person in Fig B and, therefore, reaffirms that percentage death is dependant on the alcohol consumption per person.

  • The above scatter plots also help explain why the Multiple Linear Regression model used explains only approximately 57% of the variation in percentage deaths in alcohol use orders because the relation between the dependant and independant variables does not appear to be linear. Therefore, the model should be optimized further.

  • The table in 2.2, shows two of the outliers El Salvador and Gautemala, both of which had one of the highest percentage deaths in alcohol use disorders in 2015 but one of the lowest alcohol consumption per capita of 3.59 litres and 2.55 litres respectively. Further research may help determine whether some of the additional predictors mentioned above such as consumption of unsafe and low quality alcohol, comparatively poorer individual health, lack of quality and affordable healthcare contributed to the high percentage of deaths in alcohol use disorders for these countries.

3 Section Jane

3.1 Question 1

3.2 Question 2

4 Section Tony

4.1 Question 1

4.2 Question 2

5 Section Gloria

5.1 Question 1

5.2 Question 2

6 Conclusion

Lastly, the conducted analysis does suggest that an increase in personal income does increase the tendency to consume more alcohol. However, maybe because of a limit on how much alcohol can be consumed by a person, the consumption of alcohol does not increase further beyond a certain income threshold. The exploration of the data available suggests that percentage death in alcohol use disorders increases with an increase in alcohol consumption and household alcohol expense, which would make sense as both alcohol consumption and alcohol expense are directly related. On the contrary, percentage deaths from alcohol use disorders actually decreases with an increase in GDP Per Capita, which is also reiterated when we explored some of the countries with the highest percentage of deaths from alcohol use disorders in 2015. The insights obtained in this report are suggestive that death may be dependent on additional predictors such as consumption of unsafe and low quality alcohol, comparatively poorer individual health, lack of quality and affordable healthcare and provides highly relevant topics for future research.

As we explored the relationship between income and alcohol consumption, as well as factors leading to alcohol-related deaths, valuable insights were discovered surrounding this issue although there may be limitations with external factors not considered in this report. By recognizing the relationship between geographical regions, social factors, and alcohol consumption, we can develop targeted interventions and strategies to mitigate the negative effects of alcohol use disorders.

7 References

8 Appendix

======= 21.80 Mongolia 17.05 Russia 14.88 El Salvador 14.55 Greenland 13.76 Guatemala 13.44 Saint Kitts and Nevis 12.95 Estonia 12.62 Ukraine 12.53 Latvia 10.59 Kazakhstan 9.92 Moldova 9.40 Lithuania 9.04 Denmark 8.95 Poland 8.30 Nicaragua 8.20 United States Virgin Islands 7.94 Finland 7.14 Kyrgyzstan 7.06 Antigua and Barbuda 6.21

1.1.3 The numbers fluctuate, but we have seen some improvements.

Table 1.2 shows the standard deviations of %death from alcohol per country from 2010-2019. Most countries with high variances also had high average %death rates such as Kazakhstan, Guatemala, Russia, Mongolia, etc.

In figure 1.2, rate groups are classified by the average annual %death rates over 2010-2019 into low (group1, <=1st quarter), medium (group2, <3rd quarter & >1st quarter) and high (group3, >=3st quarter). Only the top ten countries are shown in each group. Note that the y axis are different in each subplot. This is to illustrate the trends in each group.

Zooming in onto the global trends, we can see that:

  • Most countries with both medium and high average annual %death from alcohol showed some improvements with a decreased %death rate.

  • Countries with low average annual %death from alcohol had more fluctuations and some even showed a slight increase.

1.1.3.1 Plot

Figure 1.2: Changes of death rates from alcohol in selected countries 2010-2019

1.1.3.2 Table

Table 1.2: The top 20 countries with highest variance in number of death caused by alcohol use disorders.
Entity Standard_deviation
Kazakhstan 1.89
Guatemala 1.30
Russia 1.26
Mongolia 1.22
Greenland 1.03
Estonia 1.02
Lithuania 0.96
Moldova 0.86
Paraguay 0.84
Saint Kitts and Nevis 0.74
Ukraine 0.70
Finland 0.68
El Salvador 0.66
Kyrgyzstan 0.53
Tajikistan 0.46
Nicaragua 0.45
Belarus 0.42
Ecuador 0.42
Denmark 0.40
Turkmenistan 0.37

1.2 Tony’s section

data <- read.csv("Merge_all_2010_2020_updated.csv")

alc_sex_regions <- data %>%
  select(Entity, Year, Prevalence_alcohol_use_disorders_male, 
         Prevalence_alcohol_use_disorders_female) %>%
  na.omit() %>%
  filter(Entity %in% c("African Region (WHO)", "Australia", "China", 
                       "European Region (WHO)", "United Kingdom", 
                       "United States"))

alc_sex_income <- data %>%
  select(Entity, Year, Prevalence_alcohol_use_disorders_male, 
         Prevalence_alcohol_use_disorders_female) %>%
  na.omit() %>%
  filter(Entity %in% c("World Bank Low Income", "World Bank Lower Middle Income", 
                       "World Bank Upper Middle Income", "World Bank High Income"))

2 What are the variations in the prevalence of alcohol disorders among men and women across different income groups?

Table 2.1: % of Males and Females with Alcohol Use Disorders in 2010 and 2019
Entity Year Prevalence_alcohol_use_disorders_male Prevalence_alcohol_use_disorders_female
World Bank High Income 2010 2.92 1.31
World Bank High Income 2019 2.86 1.29
World Bank Low Income 2010 2.04 0.56
World Bank Low Income 2019 2.06 0.57
World Bank Lower Middle Income 2010 1.91 0.40
World Bank Lower Middle Income 2019 1.77 0.39
World Bank Upper Middle Income 2010 2.37 0.70
World Bank Upper Middle Income 2019 2.42 0.65

An association between income and alcohol disorders reveals noteworthy deviations from 2010 to 2019 but it is also important to consider the influence of other socioeconomic factors. There is a noticeable distinction between the high-income group and other income groups. The percentage of alcohol disorders is significantly greater or approximately doubled when compared with other income groups. Females in the high-income group may be subject to societal expectations or gender roles which lead to increased alcohol related issues. There is a consistent trend indicating a higher prevalence of alcohol disorders among individuals from the higher income groups possibly with work pressures. Interestingly, the income groups among both males and females both have the lower-middle income group with the lowest rate of alcohol disorders. This may be due to considerations such as lower financial stressors and mental and physical well-being. However, the low and upper-middle income groups have a higher rate particularly among the males. Similarly, this may be due to financial and work stressors.

3 What is the temporal trend in the prevalence of alcohol disorders among men and women across major regions?

Table 3.1: % of Males and Females with Alcohol Use Disorders in 2010 and 2019
Entity Year Prevalence_alcohol_use_disorders_male Prevalence_alcohol_use_disorders_female
African Region (WHO) 2010 1.70 0.58
African Region (WHO) 2019 1.68 0.58
Australia 2010 2.51 1.33
Australia 2019 2.72 1.41
China 2010 1.91 0.45
China 2019 2.17 0.40
European Region (WHO) 2010 3.47 1.39
European Region (WHO) 2019 3.36 1.33
United Kingdom 2010 4.83 1.40
United Kingdom 2019 5.48 1.48
United States 2010 3.28 1.88
United States 2019 3.22 1.80

Across all regions, it is clear there is a higher percentage of males with alcohol disorders than females and this has not changed significantly over the period 2010 to 2019. However, there is no evident similar trends between each region as there are various attributable factors such as evolving social norms, increased alcohol availability and alterations in cultural traditions. Regardless, it is apparent over this period, males with alcohol issues are increasing in Australia, China and the United Kingdom.

>>>>>>> 7ee7f3ca4630f0a16e7d243144eb8a12b237717f >>>>>>> 452f1d31bff7bff857fd9e246d732a95ef8639d8